DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation

Zhang, Yizhe; Sun, Siqi; Galley, Michel; Chen, Yen-Chun; Brockett, Chris; Gao, Xiang; Gao, Jianfeng; Liu, Jingjing; Dolan, Bill

Computer Science > Computation and Language

arXiv:1911.00536 (cs)

[Submitted on 1 Nov 2019 (v1), last revised 2 May 2020 (this version, v3)]

Title:DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation

Authors:Yizhe Zhang, Siqi Sun, Michel Galley, Yen-Chun Chen, Chris Brockett, Xiang Gao, Jianfeng Gao, Jingjing Liu, Bill Dolan

View PDF

Abstract:We present a large, tunable neural conversational response generation model, DialoGPT (dialogue generative pre-trained transformer). Trained on 147M conversation-like exchanges extracted from Reddit comment chains over a period spanning from 2005 through 2017, DialoGPT extends the Hugging Face PyTorch transformer to attain a performance close to human both in terms of automatic and human evaluation in single-turn dialogue settings. We show that conversational systems that leverage DialoGPT generate more relevant, contentful and context-consistent responses than strong baseline systems. The pre-trained model and training pipeline are publicly released to facilitate research into neural response generation and the development of more intelligent open-domain dialogue systems.

Comments:	Accepted by ACL 2020 system demonstration
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:1911.00536 [cs.CL]
	(or arXiv:1911.00536v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1911.00536

Submission history

From: Yizhe Zhang [view email]
[v1] Fri, 1 Nov 2019 18:16:54 UTC (323 KB)
[v2] Tue, 28 Apr 2020 05:45:19 UTC (422 KB)
[v3] Sat, 2 May 2020 07:09:50 UTC (325 KB)

Computer Science > Computation and Language

Title:DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation

Submission history

Access Paper:

References & Citations

5 blog links

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation

Submission history

Access Paper:

References & Citations

5 blog links

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators